Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Short text classification using latent Dirichlet allocation
ZHANG Zhifei MIAO Duoqian GAO Can
Journal of Computer Applications    2013, 33 (06): 1587-1590.   DOI: 10.3724/SP.J.1087.2013.01587
Abstract2354)      PDF (555KB)(3465)       Save
In order to solve the two key problems of the short text classification, very sparse features and strong context dependency, a new method based on latent Dirichlet allocation was proposed. The generated topics not only discriminate contexts of common words and decrease their weights, but also reduce sparsity by connecting distinguishing words and increase their weights. In addition, a short text dataset was constructed by crawling titles of Netease pages. Experiments were done by classifying these short titles using K-nearest neighbors. The proposed method outperforms vector space model and topic-based similarity.
Reference | Related Articles | Metrics
Rough set based attribute reduction with consistent confidence
GAO Can MIAO Duo-qian ZHANG Zhi-fei ZHANG Hong-yun
Journal of Computer Applications    2012, 32 (04): 1067-1069.   DOI: 10.3724/SP.J.1087.2012.01067
Abstract1026)      PDF (612KB)(398)       Save
In order to solve the problem of reduction anomaly in the existing probabilistic rough set models, non-parameterized and parameterized maximum decision entropy measures for attribute reduction were proposed by using the concept of maximum confidence of uncertain object. The monotonicity of the parameterized maximum decision entropy was explained and the relationship between its attribute reduction and other ones was analyzed. The definitions for core and relatively dispensable attributes in the proposed model were also given. Moreover, non-parameterized and parameterized confidence discernibility matrixes were put forward and the difference of classical discernibility matrix and the proposed ones in charactering the uncertain object were discussed. Finally, a case study was given to show the validity of the proposed model.
Reference | Related Articles | Metrics